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Socializing Autonomous Units with the Reflexive 
Game Theory and Resonate-and-Fire neurons 

Sergey Tarasenko 


Abstract —In this study the concept of reflexia is applied to 
modeling behavior of autonomous units. The relationship between 
reflexia, on the one hand, and mirror neuron system and perception 
of emotions, on the other hand, is introduced. The main method of 
using reflexia in a group of autonomous units is Reflexive Game 
Theory (RGT). To embody RGT in a group of autonomous agents a 
communication system is employed. This communication system uses 
frequency domain multiplexing by means of Izhikevich’s resonate- 
and-flre neural models. The result of socialization of autonomous 
units by means of RGT and communication system is illustrated in 
several examples. 

Keywords —^reflexive game theory, multiagent systems, resonate- 
and-flre neurons 

I. Introduction 

The notion of reflexia in the psychological context was 
fisrt introduced by Lefebvre in late 60s (TJ, Q, lO. 

Reflexia means projection of the external world on one’s 
mental state. More specifically, if a human being stands in 
the field of barley he/she can imagine onesself standing in 
the field of barley from the 3rd person’s perspective. Thus, 
preserving the egocentric point of view humans are capable 
of imaganing their own allocentric representation. 

Therefore, the gist of refiexia is an ability to imagine self 
perception in the allocentric reference frame (externaj^ point 
of view) being operating in the one’s egocentric reference 
frame. 

Refiexia is an ability to penetrate into the deeper layers of 
one’s psychological state. An abstract example of penetration 
into the deeper layers is when subject a can imagine another 
person (subject 6), who is imaganing subject a, the world 
around and himself imagaging it. 

The overall results about the justification of refiexia and its 
application for modeling human behavoir has been summa¬ 
rized by Vladimir Lefebvre, the principle investigator in this 
field, in his book “Algebra of Consciece” ifTTIl . |[T6l . 

Here we would like to refer two more examples of potential 
implementation of refiexia: mirror neuron system and percep¬ 
tion of emotions. 

Mirror neuron system: The key concept of the Mirror 
Neuron System discovered by Rizzolatti and his colleagues 
SI, 0 is that there are neurons in primate brain that activate 
in both cases when primate is doing a particular action itselt or 
observe someone else doing the same action. Therefore mirror 
neuron system translates external state of another agent into 
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^Term egocentric means perception from the 1st person perspective, while 
term allocentric means the perception from the 3rd person perspective. 

^Extcnal means here outside of ones body or any other feasable entity 


the internal state of the current agent. Therefore primates can 
repeat the observed action. This functionallity is very close to 
the notion of reflexia. 

Perception of emotions: Usually, the emotions are character¬ 
ized by some physiological patterns of body activity on the one 
hand, and external expression by face mimic or gestures on the 
other hand. A reproduction by the onesself of physical part of 
emotional pattern, i.e., just making an angry face, can elicit the 
anger as emotional state itself 0, m This suggests that there 
is a mechanism that helps mapping someones internal state 
to the self. For instance, Edgar Allan Poe, in his story “The 
Purloined Letter”, describes how one character is attemping 
to understand the intensity of emotional experience of another 
character by self-mimicing (or imitating) facial expressions of 
another one. 

Most recently the principle of refiexia have been reconsid¬ 
ered in the shape of the Refiexive Game Theory (RGT). The 
expectations are that since refiexia is intrinsic ability of human 
being and the principles and models proposed by Lefebvre 
have been proved to be true, the Refiexive Game Theory can 
deliver the human-like decision-making processes. 

In contrast to the Game Theory based on purely utilitary 
and rational principle, the Refiexive Game Theory is based 
on the human’s decision making. The Conner stone of the 
Refiexive Game Theory is the egoism forbiddeness principle, 
while Game Theory is build upon pure egoism stemming from 
(Min Loss - Max Utility) principle. 

Most recently RGT has been applied for theoretical model¬ 
ing of human-robot groups ca, in which robots were sucess- 
fuly refraining people from doing risky actions. However, the 
issue of how to embody RGT into the system of robotic agents 
is still an open questions. 

The goal of this paper is to illustrate how application 
of RGT algorithms for modeling purely robotic groups can 
literary humanize the robots by given them human sensitivity 
instead of cold rationality, which is usually attributed to 
machines. A certian structure of communication system to 
enable robots “talk” to each other is proposed. 

IL Brief Overview of the Reflexive Game Theory 

A. Representation of groups: graphs, polynomials, stratifica¬ 
tion tree and decision equation 

The exhaustive desription of the Refiexive Game Theory 
(RGT) and tutorial of RGT application have been presented 
by Lefebvre 0,0, [?]. Here, we present a brief overview 
of RGT enough to understand its basic concept and formulate 
the tasks solved in this paper. 
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The RGT deals with groups of abstract subjects (individuals, 
humans, autonomous agents etc). Each subject is assigned 
a unique variable {subject variable). Any group of subjects 
is represented in the shape of fully connected graph, which 
is called a relationship graph. Each vertex of the graph 
corresponds to a single subject.The name of each vertex is 
a unique subject variable. 

The RGT uses the set theory and the Boolean algebra as the 
basis for calculus. Therefore the values of subject variables are 
elements of Booleans algebra. 

All the subjects in the group can have either alliance or 
conflict relationship. The relationships are illustrated with 
graph ribs. The solid-line ribs correspond to alliance, while 
dashed ones are considered as conflict. Eor mathematical 
analysis alliance is considered to be conjunction (multipli¬ 
cation) operation (•), and conflict is defined as disjunction 
(summation) operation (+). 

The decomposable relationship graphs m, El, Co) can be 
presented in the analytical form of a corresponding polyno¬ 
mial. Any relationship graph of three subjects is decompos¬ 
able. Consider three subjects a, b and c. Let subject a is in 
alliance with other subjects, while subjects b and c are in 
conflict (Eig. [^. The polynomial corresponding to this graph 
is ab + c. 

Regarding a certain relationship, the polynomial can be 
stratified (decomposed) into sub-polynomials El, El- Each 
sub-polynomial belongs to a particular level of stratification. 
If the stratification regarding conflict (alliance) was first built, 
then the stratification regarding alliance (conflict) is imple¬ 
mented on the next step The stratification procedure finalizes, 
when the elementary polynomials, containing a single variable, 
for each variable are obtained after a certain stratification 
step. The result of stratification is the Polynomial Stratification 
Tree (PST). It has been proved that each non-elementary 
polynomial can be stratified in an unique way, i.e., each non- 
elementary polynomial has only one corresponding PST (see 
Col considering one-to-one correspondence between graphs 
and polynomials). Each higher level of the tree contains 
polynomials simpler than the ones on the lower level. Eor the 
purpose of stratification the polynomials are written in square 
brackets. The PST for polynomial a6 + c is presented in Pig|^ 

We omit the branches of the PST and from each non- 
elementary polynomial write in top right corner its sub- 
polinomials. The resulting tree-like structure is called a di¬ 
agonal form^, 131, ifTTl . ifTbl . Consider the diagonal form 
corresponding to the PST presented in Pig. 

[a\[b\ 

[ab] +[c] 

[ab + c] 

We introduce the universal set, which contains the elemen¬ 
tary actions. Eor example, these actions are actions a and 
p. The Boolean algebra of the universal set includes four 
elements: 1 = {a^P}, {a}, {/3} and the empty set 0 = {} 
= 0. The diagonal form is considered to be a function defined 
on the Boolean algebra. 

Accroding to definition given by Lefebvre ifTTIl . we present 


c 


a ^ 



Fig. 1. Relationship graph. 


[aHb] 

[ab+c] 


Fig. 2. Polynomial Stratification Tree. Polynomials [a],[b] and [c] are 
elementary polynomials. 


here exponential operation defined by formula 

= p + w, 


( 1 ) 


where W stands for negation of W is, B, [?]. It can 
be shown that function in eq. 0 is equivalent to implication 
function ca. 

This operation is used to fold the diagonal form. During 
the folding, round and square brackets are considered to be 
interchangeable. Next we implement folding of diagonal form 
of polynomial ab + c: 


[a][b] 

m +[c] 

[ab + c] 


= ab c 


B. The Decision Equation: definition and solution 

Each subject in the group should choose an alternative 
(element) of the Boolean algebra. We consider the decision 
equations. Each equation corresponds to a single subject. The 
solution of such equation defines the choice of each subject 
in the group. This equation contains subject variable in the 
left-hand side and the result of diagonal form folding in the 
right-hand side: 


a = ab c 

(2) 

b = ab^ c 

(3) 

c = ab -\- c 

(4) 

To solve the decision equation, one should first transform it 
into canonical form (81, ifTSl defined as: 

X = Ax + Bx , 

(5) 


where x is the subject variable, and A and B are some sets. 

This equation has solution if and only if the set B is 
contained in set A: A D 5. If this requirement is satisfied, then 
eq.([^has at least one solution from the interval A D x D 5 
HI, lEl, [?]. Otherwise, the decision equation has no solution, 
and it is considered that subject cannot make a decision. 
The state of inability to make decision (choice) is called a 
frustration. The explicit tranformation of decision equation 
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TABLE I 

Influence Matrix 



a 

b 

c 

a 

a 

{«} 

{/?} 

b 

{/3} 

b 

{/?} 

c 

{/3} 

{/3} 

c 


for subjects a, b and c into canonical form is consider in IT^ . 
Therefore, here we only provide the resulting canonical forms: 


a = (6 + c)a + cd 

(6) 

6 = (a + c)b + cb 

(7) 

c = cF abc 

(8) 


Next we consider two tasks, which can be formulated 
regarding the decision equation in the canonical form and 
briefly discuss methods to solve each task. 

C The Forward and Inverse Tasks 

In this section, we only illustrate the introductory examples 
of the Forward and Inverse task of the RGT. The comprehen¬ 
sive explaination how to solve the forward task can be found 
here M, IS, [?]. The issues regarding the Inverse task are 
discussed in details in m. 

The Forward Task. The variable in the left-hand side of 
the decision equation in canonical form is the variable of the 
equations, while other variables are considered as influences 
on the subject from the other subjects. The forward task is 
formulated as a task to And the possible choices of a subject 
of interest, when the influences on him from other subjects 
are given. 

The mutual influences in the forward task are presented in 
the Influence matrix (Table |I]). The main diagonal of influence 
matrix contains the subject variables. The rows of the matrix 
represent influences of the given subject on other subjects, 
while columns represent the influences of other subjects on the 
given one. The influence values are used in decision equations. 

I illustrate solution of the forward task using subjects a, b 
and c. By using canonical forms of decision equations for each 
subject (eqs. [6][^ and Influence matrix, we obtain the choice 
of each subject: 

subject a: a = ({/3} -h {/3})a + a ^ a = {/3}a + a. 

subject b: b = b ^6 = 6 + {fl}b. 

subject c: c = c-\- ^ c = c -h {{fl} F 

{(t})c => c = 1 . 

Equation for subject a does not have any solutions, since set 
A = {(3} is contained in sot B = 1: A c B. Therefore, subject 
a cannot make any decision. Therefore he is considered to be 
in frustration state. 

Equation for subject b has at least one solution, since A = 
1 = 3 B = {f3}. The solution belongs to the interval 

^ ^ b ^ {/3}. Therefore subject b can choose any alternative 
from Boolean algebra, which contains alternative {(3}. These 
alternatives are 1 = {a, (3} and {13}. 


Equation for subject c turns into equality c = 1. This is 
possible only in the case, when A = B = 1. 

The solution of the Forward task can be algorithmized as 
follows: 

1) formalize of actions in terms of Boolean algebra of 
alternatives; 

2) represent a group as relationship graph; 

3) represent relationship graph 

The Inverse Task. The inverse task is formulated as a task 
to And all the simultaneous (or joint) influences of all the 
subjects together on the subject of interest that result in choice 
of a particular alternative or subset of alternatives. We call the 
subject of interest to be a controlled subject. 

Let subject a is the controlled subject and a* is a flxed value 
representing an alternative or subset of alternatives, which 
subjects b and c want subject a to choose. By substituting 
subject variable a in decision equation for flxed value a*, we 
obtain the influence equation: 

a* = (6 + c)a* +c^, (9) 

In contrast to the decision equation, which is equation of 
a single variable, the influence equation is the equation of 
multiple variables. We need to And all the joint influences of 
subjects b and c in form of pairs (6, c). Let a* = {a}, then 
we need to solve the system of equations 


I 6 + c= [a] 

\c={a} 

Consequently, we have to solve equation 


( 10 ) 


( 11 ) 


and to And all the pairs (6, c), results in solution a* = {a}. 


These pairs are solutions of the system (10). Therefore, we 
run all the possible values of variable b and check if the first 
equation of the system in) turns into true equality: 

6 = 1 : 1 -f {(t} = 1^17^ 
b = {a} : [a] -h [a] = {a} => {a} = [a] 
b = 1/3} : 1/3} + {a} = 1 ^ 1 {a}; 

6 = 0:0 + {a} = {a} ^ {a} = {a}. 

Therefore, out of four possible values, only two values 
{(t} and 0 are appropriate. Thus, we obtain two pairs (6, c): 
({(t},{(t}) and ({(t}, 0). These pairs represent the strategies 
of reflexive control. 

This conludes the overview of the Reflexive Game Theory. 
As a final remark, we show Basic Control Schema of Abstract 
Individual (BSCAS) (LigJ^. The detailed decription of the 
BSC AS can be found in ifl^ . 

Summarizing this section, I emphasize the information 
needed for the RGT to be applied. Lirst of all, we need 
to define the universal set of actions and the corresponding 
Boolean algebra. Second, we need to know the structure of a 
group. Linally, we need to know the mutual influences of the 
group members. 

This imposes the following requirements of functionallity 
of autonomous units. Autonomous units have to be able to 1) 
code each alternative in the Boolean algebra and relationships; 
2) trasmit this information to each other in a manner that each 
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Fig. 3. The Basic Control Schema of an Abstract Subject (BSCAS). 


unit could ‘understand’ the information address to it and at 
the same time all the units should be aware about all the 
information transfered from any unit to any unit. 

How to code and transfer this information in the groups of 
autonomous units is the main question discussed in the rest 
of this study. Therefore, the material presented hereafter is 
dedicated to the matter of how an autonomous unit can obtain 
the required information. 

III. Introducing Communications between 
Autonomous Units 


Euler method, we striaghtforwardly obtain difference equation 
( p^ from differential equation ( p^ : 

z(t + r) = z{t) H- r{b + iuj)z{t), (14) 


where r is a small time step. We set r to 0.005 in all our 
simulations. 


Iterating difference equation (14) with 2;(0) = 2 ^ 0 , we can 
approximate the analitical solution of eq.(p^ and, conse¬ 
quently, of system •tg- 

Now to obtain value of voltage variable y{t) at time t, we 
only need to take imaginary part of z{k). 

The real nearons produce a spike, once value of y{t) equal 
to or exceeds some preset threshold. However, this feature is 
not provided by the linear model. 

Therefore we slightly modify the original model by adding 
the ‘spiking’ condition: 


y{t) > threshold y{t) = 1.5 
y{t + T) = 0.1 


In this study, we use the abstract autonomous units, which 
are capable of communicating with each other in the frequency 
domain. Therefore, these units can distinguish between several 
frequencies. Each frequency can be used as a carring frequency 
to transmit the information between the autonomous units. 

To be capable of distinguishing between various frequencies 
the autonomous units are supplied with frequency selective 
devices - resonators. There are many possible implimentations 
of the resonators. In this study, we suggest to used the 
Resonate-and-Fire linear neural model proposed by Izhikevich 
ca. The choice of this model is justified by its simplicity in 
implementation and low computational complexity. Next, we 
provide brief overview of the Resonate-and-Fire linear model. 

A. Resonate-and-fire neurons 

The original linear model of resonate-and-fire neuron pro¬ 
posed by Izhikevich ina is described by the system of two 
differential equations: 

{x = bx-u:y ^^ 2 ) 

1 ^ = ujx -h by 

where x is current-like or recovery vairable, and y is voltage¬ 
like or action potential variable, in terms of neuroscience. Both 
variables x and y are functions of time: x = x{t) and y = 
y{t). Notation x means derivative of x with respect to time 
t: X = dxjdt. Paramater cc is the eigen-frequency, which is 
preferred or resonant frequency of the system, and represents 
frequency of subthreshold oscillations; parameter b is ananalog 
of damping factor in the linear damped oscillator. The value 
of paramter b is set to -0.1 throughout our simulations. 

For the purpose of numerical intergration the system 
can be transformed into the form: 

z = {b iuj)z . (13) 

Then variables x and y are real and imaginary parts of 
complex variable z, respectively. Using eq.(p^ and first-order 


A threshold value is to 1 throughout the simulations. There¬ 
fore, the ultimate model of resonate-and-fire neuron used in 
this study is described by the system 


z{t + r) = z{t) -h r{b + iu)z{t) 

< if y{t) > 1 y{t) = 1.5 (16) 

^y{t^r) = 0.1 


We present the sample dynamics of two resonate-and- 
fire neurons, described by system with different eigen- 
frequencies uji = 37r/2 and cc 2 = 47r/3 in Fig. 

It is illustrated that neurons with eigen-frequency cci spikes 
for the series of pulses with the same frequency and does 
not respond to the series of pulses with frequency 002 . The 
same is true for the second neurons regarding shift in roles of 
frequencies cci and uj 2 - 

Thus, we have described the mechanism of frequency se¬ 
lectivity. This can be used to enable multiple neurons to talk 
to each other via the same medium by means of Frequency 
Domain Multiplexing. 

However, the linear model has other important properties. 
The inhibitory pulses can also make resonate-and-fire neurons 
to spike, if the inhibitory pulses are applied with the eigen- 
frequency of the neuron (Fig. [^. 

However, it is not the final feature of this model. It is 
possible to make the neuron fire with series of pulses of 
different magnitudes. For example, let the magnitudes of the 
first, second and third pulses are 0.1, 0.4 and 0.6, respectively 
(Fig. 6). The same result will occur for the inhibitory pulses 
(Fig.Q. 

Since the neurons are selective to a certian frequency, it is 
possible to transfer signals of several frequencies through the 
same communication channel. 

This concludes the description of communication system. 
Next we consider the framework to manage the groups of 
autonomous units. 
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Neuron with eigen-frequency = 3 ji/2 




Neuron with eigen-frequency = 3 ji/2 




Fig. 4. The Resonate-and-Fire neurons. Top: solid line illustrates resonanse 
with the input frequency uoi = 37r/2, dashed line shows only subthreshold 
oscillations meaning that neuron does not respond to the frequency 002 = 
47r/3. Bottom: solid line illustrates resonanse with the input frequency UJ 2 = 
47r/3, dashed line shows only subthreshold oscillations meaning that neuron 
does not respond to the frequency cui = 37r/2. The green and red vertical lines 
indicate the equal input pulses of magnitude 0.4. Green and red pulses are 
provided with frequencies uoi = 37r/2 and 002 = 47r/3, respectively. Each series 
of pulses starts 1 ms after the system onset. Threshold is set to 1. Parameter 
b is -0.1. 


Neuncn with eigen-frequency ki.| = 3ji/2 




Fig. 5. The selective responses of resonate-and-fire neurons to the series of 
equal inhobitory pulses (magnatude -0.4). 


IV. Building the Groups of Autonomous Units 

In this section we introduce a sketch of communication 
system to socialize autonomous units. 

A. Information Coding 

Each autonomous unit has several resonators tuned to par¬ 
ticular frequencies. Each resonator corresponds to a certian 
unit in the group. Therefore, the total number of resonators 
equals the total number of units in the group. Eor each unit, we 
reserve its unique frequency. Once the information is obtained 
from resonators with a frequency accosiated with this unit, it 
is that this unit is an addressee. 


Fig. 6. The selective responses of resonate-and-fire neurons to the excitatory 
series {0.1, 0.4, 0.6}. 


Neuron with eigen-frequency a).| = 3ii/2 




Fig. 7. The selective responses of resonate-and-fire neurons to the inhobitory 
series (-0.1, -0.4, -0.6}. 


Using the resonate-and-fire neurons presented in the pre¬ 
vious section, we can transfer different types of information 
throught the network of autonomous units. In fact, we can 
transmit two types of information coded by 1) the kind of 
pulses (exictatory vs inhibitory), and 2) selecting different 
magnitude of pulses in the series. 

These two types of information are enough to model the 
groups in the Refiexive Game Theory. We consider a certain 
frequency to be the unique identifier of the autonomous unit 
in the group. Next, if the series pulses contains the excita¬ 
tory impulses, it is considered that two units are in alliance 
relationships, or they are in conflict otherwise. Einally, we can 
define a certian alternative of the Boolean Algebra by a certain 
seris of pulses. 

B. Receiving Informaiton in the Group 

So far, we understand how a certian unit can recieve 
information. The question remains how autonomous unit can 
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Fig. 8. Receiving messages in the network. 


understand where the signal conies from or which unit sends 
it? 

For this purpose, we reserve a series of equal pulses 
{0.4,0.4,0.4}. Hereafter, we refer to the series of pulses as 
code or message. In particular, we call the code {0.4,0.4,0.4} 
to be identification-code (ID-code), if this code is transmitted 
by a certain unit on its own preferred frequency. 

Suggest, we have three units a, b and c. Each unit is char¬ 
acterized by its preferred frequency: uja = 37r/2, = 47r/3 

and ujc = 57r/3. 

If autonomous unit a with eigen-frequency uJa decides to 
send some code to another unit, it first sends ID-code on the 
frequency cca- Therefore, the corresponding neuron spikes in 
each autonomous units, and units b and c ’understand’ that 
unit a wants to send a code. This can be considered as unit a 
attracts attention of the other units in the group. Then, after a 
short delay (0.5 sec) after spike on the frequency uJa, unit cl 
sends a certain code on the frequency uosubject, where subject 
can be either b or c. 

As an example, we consider that unit a wants to send 
its ID-code to unit c. Therefore, unit a first sends ID-code 
10.4,0.4,0.4} on the frequency Ua to attract attention of other 
units: in units b and c the channels with frequency Ua show a 
spike (Fig.[^ top). Then, 0.5 sec after a spike on the frequency 
uja , unit CL sends the ID-code on the frequency cCc- in units 
b and c the channels with frequency uJc show a spike (Fig. 
center). Since, frequency uJc is the frequency reserved for unit 
c, unit c receives ID-code from unit a. At the same time 
channel with frequency shows no spike (Fig. bottom), 
and unit b ‘understands’ that ID-code is not addressed to it. 

This way each unit in the groups can become completely 
awear about the whole information transmitted between any 
two units. Therefore, such communication schema provides all 
necessary information for application of the Reflexive Game 
Theory. 

C. How to Install Relationships in the Group 

Now we consider how to install relationship between units. 
Each unit decides on its own, which type of relationship 


TABLE II 

Internal state oe each unit regarding relationships with 

OTHERS 



a 

b 

c 

a 

- 

0.81 

0.92 

b 

0.63 

- 

0.12 

c 

0.09 

0.27 

- 


(conflict or alliance), it wants to install with other units. For 
example, we consider that the relationships are decided at 
random, meaning that at the very begining the units do not 
have any infomation about each other, except for the preferred 
frequencies. Therefore, this condition can be assumes as 
guessing. The human guessing based on no prior information 
has been describe from both theoretical and experimental 
points of views in ifT^ . ifTTIl . In the case of two options, one 
option (positive pole) is chosen with probability p ^ 0.61, 
while another option (negative pole) is chosen with probability 
1 — p « 0.39. The concept of option’s polarity has been first 
introduced by Lefebvre ca. We consider alliance relationship 
to be positive pole and conflict relationship to be a negative 
pole. 

The alliance and conflict relationships are coded with codes 
10.4,0.4,0.4} and {-0.4,-0.4,-0.4}, respectively. We call codes 
{0.4,0.4,0.4} and {-0.4,-0.4,-0.4} to be alliance and conflict 
codes, respectively, if they are transmitted NOT on the pre¬ 
ferred frequency of the unit, which sends it. The alliance and 
conflict codes are chosen with probabilities 0.61 and 0.39, 
respectively. 

However, to install the relationship, the decisions of both 
units are needed. In other words, since units chosen the type 
of relationships independently from each other, it is possible 
that, for example, unit a sends conflict code to unit b, but unit b 
sends alliance code to unit a. Therefore, each unit has decided 
its own relationship, which is different from the one chosen 
by counterparty. Thus, the codes are different. We define that 
the alliance relationship is installed if and only if both units 
send alliance code to each other, the conflict relationship is 
installed otherwise. Thus, the relationship between units a and 
b is conflict. 

If we consider codes {0.4,0.4,0.4} and {-0.4,-0.4,-0.4} as 
logic 1 and 0, respectively, the alliance relationship can be 
defined as logic conjunction (AND) function, and conflict 
relationships as disjunction (OR) function. 

Example 1. Let us generate a group of three units with 
randomly chosen relationships. We use a uniform random 
number generator with interval (0,1). If the value of random 
variable exceeds 0.61, unit x generates conflict code {-0.4,- 
0.4,-0.4} to some other unit, otherwise it generates alliance 
code {0.4,0.4,0.4} (Table |^. The rows of Table [^contain the 
decisions about relationships that each unit generated itself, 
but have not yet transmitted to other units. Therefore, Table 
lU illustrates internal state of each unit. This internal state is 
yet not known by other units in the group. 

According to Table ^ unit a will send conflict code to both 
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TABLE III 

Transmitted relationship codes 


TABLE IV 

INELUENCE MATRIX 



a 

b 

c 

a 

- 

0 

0 

b 

0 

- 

1 

c 

1 

1 

- 



a 

b 

c 

a 

a 

{«} 

{} 

b 

{«} 

b 

{/3} 

c 

{/3} 

{} 

c 


units b and c (Fig. [^. Unit b will send conflict code to unit 


10) . Unit c will send alliance 

11 ) . 


and alliance code to unit c (Fig. 
code to both units b and c (Fig. 

After the codes have been transmitted from each unit to 
each unit, the information from Table [II] become available to 
each unit in the group. Using the correlation between conflict 
and alliance codes and logic 1 and 0, we can rewrite Table [I^ 
Therefore, using informaiton from Table |I^ together with 
conjunction and disjunction functions, we obtain the relation¬ 
ships installed between the units: units b and c are in alliance, 
while unit a is in conflict with both units b and c. 

Since, the informaiton about the relationships between units 
is now known by each unit in the group , we can construct 
the relationship graph (Fig. of the Reflexive Game Theory 
and obtain the polynomial corresponding to this graph, which 
is a + be. 


D. How to Transfer Information about the Influences 

In this section, we illustrate how to transmit influences of 
unit on each other. We use the same approach described in 
the previous section. The only difference is that instead of the 
alliance or conflict codes, unit transmits some code associated 
with a particular alternative. 

Example 2. Suggest, we have Boolean algebra of four 
alternatives: 1 = {a}, {fl} and 0 = {}. We arbitrary 

assign a certain code to each alternative: 

1) code {0.2, 0.3, 0.7} corresponds to alternative 1 = 

2) code {0.7, 0.3, 0.2} corresponds to alternative 0 = {}; 

3) code {0.5, 0.2, 0.5} corresponds to alternative {a}; and 

4) code {0.3, 0.6, 0.3} corresponds to alternative {fl}. 

To make reference easier, we refer to each code as ‘altem- 
tive name’-code: 

code {0.7, 0.3, 0.2} is called unit-code; 

code {0.2, 0.3, 0.7} is zero-code; 

code {0.5, 0.2, 0.5} is referred as {a}-code; and 

code {0.3, 0.6, 0.3} is {/3}-code. 

We assume that unit a makes influences {a} and 0 = {} on 
units b and c, respectively (Fig. 12). Unit b makes influence 
{(t} on both units a and c (Fig. 13). Unit c makes influences 
{(t} and {/3} on units a and b, respectively (Fig. 14). 


E. RGT Inference 

Therefore, after all the influences have been transmitted, we 
obtain the influence matrix (Table [rv| ). 

Thus, each unit now has complete information to apply the 
RGT inference schema based on the decision equations ca. 


The canonical form of decision equation for unit a is a = 
a + bed and the corresponding solution interval is 1 3 a 3 6c. 
The canonical form of decision equation for unit 6 is 6 = 
(a+c)6-l-a6 and the corresponding solution interval is (a+c) 3 
b ^ a. The canonical form of decision equation for unit c is 
c = (a + 6)c -|- ac and the corresponding solution interval is 
(a + 6) ^ c 3 a. 

Under the given influences, the choice of unit a is define 
by the interval 1 ^ a 3 {}. The solution interval for unit b 
turns into equality b = {a}. The choice of unit a is define by 
the interval {/3} ^ a 3 {}. 


V. Cooperative Behavoir oe the Autonomous Units 
Controlled by the RGT Algorithms 

Until now, we have briefly described the gist of the RGT, 
communication system for the autonomous units and illus¬ 
trated how the units can use the information obtained by means 
of the communication system for the RGT inference. 

In this section we consider how units can make mutual 
influences in order to achieve a particular goal in a cooperative 
behavior task. 

Example 3. Consider four autonomous units (robots). Let 
these robots are functioning by using the electric accumulators. 
There’s a chaging pool in the restricted perimeter (castle), 
which has only one entrance (gate). There are three robots 
a, b and c in the castle, and one robot d outside the perimeter. 

Let robots a, b and c are in alliance with each other and in 
conflict with robot d. Robots a, b and c are locked inside the 
perimeter, but they can open the gate if each robot agrees so. 
The power source has a limited capacity is only 25% full. 

The accumulators of robots a, b and c are 50% full, 
while accumulator of robot d is only 10% full. The power 
source should be regenerated. However, the time left until 
regeneration exceed the life-time of the robot d’s accumulator. 

Consider ‘open the gates’ is action 1 (unit-code). Then 
Boolean algebra of alternatives contains two elements 1 - to 
open the gates, and 0 - ‘don’t open’. 

From the point of view of Game Theory, it is clearly out 
of utility to share the power source with ‘exhausted’ enemy 
robot, but let’s have look what should happen if RGT inference 
is applied. 

The diagonal form transformation is present as follows: 
[a]m 

[abc\ -^[d] 


[abc + d] 


= abc + d 

























Unit a transmits ID and Conflict codes to Unit b 



Unit a transmits ID and Conflict codes to Unit c 



Fig. 9. Activity of unit a. 


Unit b transmits ID and Conflict codes to Unit a 



Unit b transmits ID and Alliance codes to Unit c 



Fig. 10. Activity of unit b. 


Unit c transmits ID and Alliance codes to Unit a 



Fig. 11. Activity of unit c 
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Unit a transmits ID code and {a}-code to Unit b - channel with oo^ = 371/2 




Fig. 12. Transmission of influnces produced by unit a. 



Fig. 13. Transmission of influnces produced by unit b. 



Unit c transmits ID code and {p}-code to Unit b - channel with 00^ _ 57 t /3 



Fig. 14. Transmission of influnces produced by unit a. 
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a c 


Fig. 15. Relationship graph for Example 3. 

The resultant decision equations in canoncal forms are: 


a = {be + d)a + da 

(17) 

b = {ad + c)b -b db 

(18) 

c = {ab + d)c + dc 

(19) 

d = df- abed 

(20) 


All three robot inside the perimeter are willing not to open 
the gates and they inlfuence by zero-code on each other and 
on robot d. Then the deicision intervals for each robot are 

(6c + d) 3 a 3 d ^ (0 + d) 3 a 3 d ^ a = d; 

{ac -\-d)^b^d^{0-\-d)^b^d^b = d; 

{ab -\-d)^c^d^{0-\-d)^b^d^c = d; 

1 D d D a6c ^ 1 D d D 0. 

Therefore, the decision of all three robot inside the perime¬ 
ter is defined by the infiuence of robot d. Therefore, it robot 
d makes infiuence 1, then all three robot will agree to open 
the gate, and since this is required condition, they will open 
the gate. On the other hand, robot d has a freedom of choice. 

Therefore, if robot d asks for help, other three robots should 
open the gate and allow access to the power supply source. 

VI. Discussion 

In this study, we have presented the structure of autonomous 
units, which allows these units to install communication with 
each other and create groups. As the basis for communication 
network, we use resonate-and-fire neurons, which are em¬ 
ployed as signal receivers. The main feature of resonate-and- 
fire neurons is their selectivity to a particular frequency, which 
is eigen-frequency of the neuron. Therefore, it is possible to 
send different codes through the same network and be sure 
that each unit understands the message addressed exclusisvely 
to it. I do not discuss here physical mechanisms of generating 
signals. 

We illustrated how it is possible to arrage a group of three 
units as a communication network. I also showed how to code 
different messages such as sender identification and Boolean 
algebra alternatives. 

We concluded with examples of how a simple group can 
be arranged based on the information about relationships 
between units and showed how to transmit the information 
about influences in the group. Thus, having received the 
information about the structure of the group and the mutual 
infiuences, each autonomous unit can apply algorithms of 
RGT inferences. Thus, each unit can make both its own 
choice and also predict the possible choices of other members 
of the group. Therefore, the fusion of the proposed sketch of 
communication network with the RGT inference allows to 


obviously demostrate principles of the RGT on the particular 
autonomous units. 
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